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[~^. . Abstract 

We are interested in the asymptotic analysis of the binary search tree (BST) under the 
fVj random permutation model. Via an embedding in a continuous time model, we get new results, 

P I ■ in particular the asymptotic behavior of the profile. 
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I~2 1 Introduction 

O . 

This paper deals mainly with two classical models of binary trees processes: the binary search 
1 tree process and the Yule tree process. 

• A labeled binary search tree (LBST) is a structure used in computer science to store totally 
ordered data. At time the LBST is reduced to a leaf without label. Each unit of time, a new 
item is inserted in a leaf of the tree. This leaf is then replaced by an internal node with two leaves. 
We are interested in the sequence of underlying unlabeled trees (T n ) n induced by this construction. 
We call this sequence the binary search tree process, or BST process. 

• The Yule tree process (T£t)t is a continuous time (unlabeled) binary tree process in which each leaf 
behaves independently from the other ones (at time 0, the tree Tlo is reduced to a leaf). After an 
(random) exponential time, a leaf has two children. Due to the lack of memory of the exponential 
distribution, each leaf is equally likely the first one to produce children. 

Under a suitably chosen random model of data (the random permutation model), the two models 
of trees are deeply related. In the Yule tree process, let r n be the random time when the n + lth leaf 
appears. Under the random permutation model the link between the two models is the following 
one: the process (TL rn ) n has the same law as {T n ) n . This allows the construction of the BST process 
and the Yule tree process on the same probability space on which (T n ) n = (TT Tn ) n . This embedding 
of the BST process into a continuous time model allows to use independence properties between 
subtrees in the Yule tree process (it is a kind of Poissonization). Many functionals of the BST can 
then be derived using known results on the Yule tree. An interesting quantity is the profile of T n 
which is the sequence {Uk{n))k>o where Uk{n) is the number of leaves of T n at level k. Here, in 
H21j) . the martingale family {M n {z))n ~ the Jabbour's martingale - which encodes the profile of 
(T n ) n is shown to be strongly related to the martingale family (M t (z)) t that encodes the profile of 
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The aim of the present paper is to revisit the study of (M n (z)) n using the embedding. For 
z > 0, we recover very quickly the behavior of the limit A^oo(^) : positive when z € (z~,z^), 
zero when z £ [z~,z£]. In the critical cases z = z^r the behavior was unknown. We prove that 
•MooO^) = a.s. and get the convergence of the derivative. The limits M'^z) and M-oo(z) 
satisfy a splitting formula (jSQ)) which, for z = 1 gives the Quicksort equation (Corollary I3.6j) . Thus, 
the embedding method is the key tool for proving and enlarging convergence results on the BST 
martingale (Theorem I3.3j) and its derivative (Theorem 13. 5J) . See the companion paper JH| for 
complements. 

The paper is organized as follows. After the definition of the models in Section 2, we explore 
some consequences of the embedding. In particular, in 2.5.1 we exhibit a family of uniform random 
variables attached to the nodes of the Yule tree. These random variables give, for every node u, the 
limiting proportion of leaves issued from u among those issued from its parent. A similar property 
holds for the embedded BST. In subsection 2.5.2, the appearance of uniform variables as limiting 
proportion of leaves is explained on a LBST model. In Section El we study the convergence, as 
n — > 00, of the BST martingale M. n (z). 

Thanks to this method, we are able in Section 0] to describe the asymptotic behavior of the 
profile Uk{n) when k ~ 2zlogn in the whole range z G (z~,z+). Previously, the result was known 
only on a sub-domain where the L 2 method works (|15j). 

Finally, in Section [5J an other point of view is investigated. In biasing the evolution rules of 
the BST - a random line of descent is distinguished, and the evolution of the nodes belonging to 
this branch is different from the other ones - it appears that the behavior of the nodes on the 
distinguished line gives information on the whole tree. 

2 The models 

2.1 Binary search trees 

For a convenient definition of trees we are going to work with, let us first define 



the set of finite words on the alphabet {0, 1} (with for the empty word). For u and v in U, denote 
by uv the concatenation of the word u with the word v (by convention we set, for any u E U, 
0u = u). If v 7^ 0, we say that uv is a descendant of u and u is an ancestor of uv. Moreover uO 
(resp. ul) is called left (resp. right) child of u. 

A complete binary tree T is a finite subset of U such that 



The elements of T are called nodes, and is called the root ; \u\, the number of letters in u, is the 
depth of u (with |0| = 0). Write BinTree for the set of complete binary trees. 

A tree T £ BinTree can be described by giving the set dT of its leaves, that is, the nodes that 
are in T but with no descendants in T. The nodes of T\dT are called internal nodes. 

We now introduce labeled binary search trees (LBST), that are widely used to store totally 
ordered data (the monograph of Mahmoud gives an overview of the state of the art). 



U = {0}U(J{O,1} 
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Let A be a totally ordered set of elements named keys and for n > 1, let (xi, ...,» n ) be picked 
up without replacement from A. The LBST built from these data is the complete binary tree in 
which each internal node is associated with a key belonging to (x±, x n ) in the following way: 
the first key x\ is assigned to the root. The next key x<i is assigned to the left child of the root 
if it is smaller than x\, or it is assigned to the right child of the root if it is larger than x\. We 
proceed further inserting key by key recursively. After the n first insertion, one has a labeled binary 
tree in which n nodes own a label: these nodes are considered as internal nodes. One adds n + 1 
(unlabeled) leaves to this structure in order to get a labeled complete binary tree with n internal 
nodes. 




Figure 1: BST built with the sequence of data 0.5, 0.8, 0.9, 0.3, 0.4 (empty squares are leaves). 

To study the shape of these trees for large n, it is classical to introduce a random model. 
One usually assumes that the successively inserted data (xi)i>i are i.i.d. random variables with a 

_ ( F} 

continuous distribution F. Under this model, let us call the LBST L n , it is a random variable 
taking values in the set of complete binary trees in which each internal node has a label in [0,1]. 
The sequence is a Markov chain. 

We are, in fact, mainly interested in the underlying tree %i of L„ , i.e. the tree that has 

( F) 

the same arborescent structure of L\ , but that has no label. We set 

{T^ F \n > 0) := (UNDER(Lf >),n > 0); 

by construction Tn is a complete binary tree. 

For every n > 1, the string x±, ..,x n induces a.s. a permutation a n such that av n (i) < x a n {2) < 
• • • < x a f n y Since the a?j are exchangeable, a n is uniformly distributed on the set S n of permuta- 
tions of {1, ..,n}. Since this claim is not sensitive to F we will assume, for the sake of simplicity, 
that F is the uniform distribution on [0,1], and we write from now L n instead of L n ' and T n 

( F*\ 

instead of Tn . This is the so-called random permutation model. Again by exchangeability, a n is 
independent of the vector (x a ru, x (7n (2) ) • ■ ■ , x a n (n)) an d we have 

P(x n+1 € (x anU) ,X an{j+1) ) | (T n ) = P(x n+1 G (Xa n (j),X* n (j+l))) 

= P(a n+1 (j + l) =n+l) = (n+1)- 1 

for every j G {0, 1, .., n}, where a? CT „(o) := an d ^^(n+i) := 1- This relation ensures the consistency 
of the sequence (a n ) n . 

One can also express this property with the help of the sequential ranks of the permutation: 
the random variables = Sj=i ^-xj<x k , k > 1 are independent and R^ is uniform on {1, . . . , k} 
(see for instance Mahmoud section 2.3), so that P{R n+ \ = j + 1 1 Ri, .., R n ) = (n + 1)~ . 
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In terms of binary search tree, this means that the insertion of the n + 1st key in the tree with 
n internal nodes is uniform among its n + 1 leaves. In other words, in the random permutation 
model, the sequence (%i) n >o is a Markov chain on BinTree defined by To = {0} and 

T n+1 = T n U{D n 0,D n l}, 
P{D n = u\T n ) = (n + 1)- 1 , u£dT n ; (1) 

the leaf D n of T n is the random node where the n + 1-st key is inserted, its level is d n . 

The difference of the rule evolutions of L n (that depends deeply on the values x±, . . . , x n already 
inserted) and T n (that depends of nothing) is similar to Markov chain in random environment (L n 
is the quenched Markov chain and T n the annealed one). 

This Markov chain model is a particular case (a = 1) of the diffusion-limited aggregation (DLA) 
on a binary tree, where a constant a is given and the growing of the tree is random with probability 
of insertion at a leaf u proportional to \u\~ a (Aldous-Shields 1\, Barlow-Pemantle-Perkins [5]). 

Here are few known results about the evolution of BST. First, the saturation level h n and the 
height H n , 

h n = min{|u| : u G dT n } , H n = max{|u| : u £ dT n } (2) 

grow logarithmically (see for instance Devroye |17j ) 

h H 
a.s. lim —2- = c' = 0.3733... lim —2- = c = 4.31107... ; (3) 

— log n n^oo log 71 



71— >OC 



the constants d and c are the two solutions of the equation 772(2;) = 1 where 

x 

rf\(x) := x log — — x + A, x > , (4) 
A 

is the Cramer transform of the Poisson distribution of parameter A. Function 772 reaches its min- 
imum at x = 2. It corresponds to the rate of propagation of the depth of insertion: 2 \ogn — * ^ 
More precise asymptotics for H n can be found in [SHI) |H2> 
Detailed information on T n is provided by the whole profile 

U k (n) :=#{«£ dT n ,\u\ = k} , k > 1 , (5) 

that counts the number of leaves of T n at each level. Notice that Uk(n) = for k > H n and 
for k < h n . To get asymptotic results, it is rather natural to encode the profile by the so-called 
polynomial level Uk(n)z k , whose degree is H n . Jabbour |15|I25| proved a remarkable martingale 
property for these random polynomials. More precisely, for z ^ = {0, —1/2, —1, —3/2, • • • } 

and n > 0, let 

M n {z) := ^73 E U ^)* k = C17) E zH > ( 6 ) 
Un[z) k>o Un[z) uedT n 



where Cq{z) = 1 and for n > 1, 

71-1 

k + 1 y ~' \ n 
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and let J 7 ^ be the cr-field generated by all the events {u G ^~j}j<n,uev ■ Then (M n (z), Ti n \) n is a 
martingale to which, for the sake of simplicity, we refer from now as the BST martingale. If z > 0, 
this positive martingale is a.s. convergent; the limit Moo{z) is positive a.s. if z G (z~,z^), with 

z ~ = c '/2 = 0.186..., zt = c/2 = 2.155... (8) 

and Aioo(z) = for z [z~ , z+] (Jabbour [55]). This martingale is also the main tool to prove 
that, properly rescaled around 21ogn, the profile has a Gaussian limiting shape (see Theorem 1 in 

EH). 

2.2 Fragmentation, Yule tree process and embedding 

The idea of embedding discrete models (such as urn models) in continuous time branching 
processes goes back at least to Athreya-Karlin jl]. It is described in Athreya and Ney (0, section 
9) and it has been recently revisited by Janson |2B]. For the BST, various embeddings are mentioned 
in Devroye in particular those due to Pittel jHS]) and Biggins |12| I13j. Here, we work with a 
variant of the Yule process, taking into account the tree (or "genealogical") structure. 

First, let us define a fragmentation process (F(t)) t >o of the interval (0, 1) as follows: we set 
1% = (0, 1) and for u = u\U2---Uk G U, set I u the interval 

k k 

i=i j=i 

Hence, each element u of U encodes a subinterval I u of (0, 1) with dyadic extremities. 

We set F(0) = 1$ = (0, 1). An exponential t\ ~ Exp(l) random variable is associated with 
F(0). At time n, the process F. jumps, the interval (0,1) splits in the middle into two parts 
and F(ti) = ((0, 1/2), (1/2, 1)) = (Iq,I\). After each jump time r, the fragments of F(r) behave 
independently of each other. Each fragment I u splits after a Exp(l)-distributed random time into 
two fragments: I u q and I u \. Owing to the lack of memory of the exponential distribution, when n 
fragments are present, each of them will split first equally likely. 

We define now the Yule tree process as an encoding of the fragmentation process. The idea is 
to interpret the two fragments I u q and I u \ issued from I u as its two children: I u q is considered as 
the left fragment and I u \ the right one; like this, we obtain a binary tree structure (see Fig. EJ). 
An interval with length 2~ k corresponds to a leaf at depth k in the corresponding tree structure; 
the size of fragment I u is 2~' U L More formally, we define the tree TTf thanks to its set of leaves 

<9Tl 4 = {u, I u G F(t)}. (9) 

We call Yule tree process, the BinTree- valued random process (TTt)t>o- Both processes (TT t )t>o 
and (F(t))t>o are pure jump Markov processes. Each process (TT t )t>o and (F(t)) t >o can be viewed 
as an encoding of the other one, using @ and: 

F(t) = {I u ,u G dTT t }. 

The counting process (N t )t>o that gives the number of leaves in TT t , 

N t := #crrr t , (10) 

is the classical Yule (or binary fission) process (Athreya- Ney [§.). 
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Figure 2: Construction of the Yule tree from the fragmentation. 



Let = ro < ti < T2 < ... be the successive jump times of TT. (or of (F(.)), 

r n = inf{f : N t = n + 1} . (11) 

The following proposition allows us to build the Yule tree process and the BST on the same 
probability space. This observation was also made in Aldous-Shields [*"] section 1, (see also Kingman 
[2*7] p. 237 and Tavare [IU] p. 164 in other contexts). 

Lemma 2.1 a) The jump time intervals (r n — T n _i) n are independent and satisfy: 

T n - T n _i ~ Exp(n) for any n > 1, (12) 
where Exp(A) is the exponential distribution of parameter A. 

b) The processes (T n ) ra >i and (HV n ) n>1 are independent. 

c) The processes (TT Tn ^ n>0 and (%i) n>0 have the same distribution. 

Proof: (a) is a consequence of the fact that the minimum of n independent random variables 
Exp(l)-distributed is Exp(n)-distributed. (6) comes from the independence of jump chain and 
jump times. Since the initial states and evolution rules of the two Markov chains TL Tn and T n are 
the same ones, (c) holds true. ■ 



Convention: (A unique probability space) From now, we consider that the fragmentation 
process, the Yule tree process and the BST process are built on the same probability space. Par- 
ticularly, on this space, we have 

(^)n>0=( T «)n>0- ( 13 ) 

We say that the BST process is embedded in the Yule tree process. We define the filtration (J r t)t>o 
by Tt = a(F(s),s < t). On the unique probability space, the sigma algebra Ft n \ is equal to 
a(F(n),...,F(r n )). 

If we consider the measure valued process (pt)t>o defined by 

Pt= 6 -^g 2 \iu\ = 5 M' ( 14 ) 

uedTt uedTt 
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we obtain a continuous time branching random walk. The set of positions is INq = {0, 1, 2, • • • }. 
Each individual has an Exp(l) distributed lifetime and does not move. At his death, he disappears 
and is replaced by two children, whose positions are both their parent's position shifted by 1. The 
set of individuals alive at time t is <9H\ and the position of individual u is simply \u\. 

This is a particular case of the following general fact: the empirical measure of the logarithm of 
the size of fragments in homogeneous fragmentations with finite dislocation measures is a branching 
random walk (this idea goes back to Aldous and Shields pQ Section 7f and 7g) . 

2.3 Martingales and connection 

The classical 1 family of "additive" martingales associated with the Yule process, parameterized 
by 9 in 1R (sometimes in C) and indexed by t > 0, is here given by 

m(t,0):= 

exp(#|u| - tL(9)), 

uedTt 

where 

L(9) = 2e e - 1 (15) 

(see jl^, [22], and [2] for the fragmentation). For easier use, we set z = e e and then consider the 
family of (J-t,t > 0)-martingales 

M(t,z):=m(t,logz) = ^ ^e t{1 ~ 2z) . (16) 

uedTt 

In particular M(t, 1/2) = 1 and M(t, 1) = e^Nf. 

The embedding formula (fT3|) allows to connect the family of BST martingales {M. n ^( n ))n to 
the family of Yule martingales (M(t, z),J r t)t- If we observe the martingale (M (., z) at the stopping 
times (r n ) n , we can "extract" (Proposition 12.21 below ) the space component A4 n (z) and a time 
component 

%{z):=e^ x -^C n {z). (17) 

Notice that ( ^(z)) is -adapted. 

A classical result (see Athreya-Ney [5] or Devroye |17| 5.4) says that, a.s., e~ l Nt converges when 
t — > +oo, and 

f := lim e^N t ~ Exp(l) . (18) 
Since lim n r n = oo a.s. (see Lemma IXTl a) ) we get from (fTTjl and . 

a.s. limne _Tn =^. (19) 

n 

Proposition 2.2 (martingale connection) Let us assume z G <D\ ~. 
1) The family ( c ^ l {z)) n>0 is a martingale with mean 1, and 

a.s. lim %{z) = (20) 

n 1 [2z) 

Moreover, if$lz, the real part of z, is positive, the convergence is in L 1 . 



1 In general \u\ is replaced by the position X u and L{9) = (5{E f e Bx Z(dx) — 1) where [3 is the parameter of lifetime 
and Z is the offspring point process; here /3 = 1 and Z = 28i. 
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2) The two martingales ( ^(z)) n >o and {■M.n{ z ))n>0 « r e independent and 

M(r n ,z)= ^(z)M n {z). (21) 

Proof: 1) The martingale property comes from Lemma 12.11 a). The Stirling formula gives the 
very useful estimate: 

n 2z-l 

which yields (j20J) owing to (fH?|). 

2) The second claim comes from (|13|) and ()16|). the independence comes from Lemma l2.1l b). ■ 

Proposition 12 . 21 allows us to transfer known results about the Yule martingales to BST martin- 
gales, thus giving very simple proofs of known results about the BST martingale and also getting 
much more. In particular, in Theorem 111. SI 2). we give the answer to the question asked in |25j . 
about critical values of z, with a straightforward argument. 



2.4 Limiting proportions of nodes 

Let us study some meaningful random variables arising as a.s limits and playing an important 
role in the results of Section |21 These variables describe the evolution of relative sizes of subtrees 
in Yule and BST models. 



2.4.1 In the Yule tree 

First, we recall a classical identity: let £ a and be two independent, Exp(l)-distributed random 
variables. We have 

U[0, 1], the uniform distribution on [0, 1]. 



For every ueU, let = inf{£ : u € TTf} be the time (a.s. finite) at which u appears in the 
Yule tree, and for t > 0, set 

Trl u) = {v€V:uv€ TT t+T(u) } 
the tree process growing from u. In particular, set 

For t > t^ u \ the number of leaves at time t in the subtree issued from node u is := N^ u ^ u y 
The branching property and (|18|) give that a.s. for every u £ U 

lime-'ivf^^ , lime-tn^ =Z u e- r(u) , (23) 

where £ n is distributed as £ i.e. Exp(l). Moreover, if u and v are not in the same line of descent, 
the r.v. £ n and are independent. Since, for t > r'"', 

n («) = n («o) +n ( u i) and r («o) =r («i) j (24) 
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a small computation yields 



(uQ) c (ttl) e 

n t n-s^ jj(uO) . = ^0 n f jj(ul) ._ ^ _ ^j(nO) _ Sttl 

which allows to attach a W([0, 1]) r.v. to each node of U. In particular we set 

U := = (26) 

so + 6 

so that 



£:=£ = e- Tl (£o + £i) , ^o = ^e Tl , £l = (1 - U)^ . (27) 

If uO and ul are brother nodes, we have [A nl ) + f/( n °) = 1. We claim that if a finite set of 
nodes v\, . . . ,Vk does not contain any pair of brothers, the corresponding r.v. U^ Vl \ . . . , U^" h ' are 
independent. When none of the Vj is an ancestor of another ("stopping line" property) it is a 
consequence of the branching property. In the general case, it is sufficient to prove that f/W is 
independent of (U^ v \v < u). To simplify the reading, let us give the details only for |u| = 2, for 
instance u = 00. We have, from (|24j) 

£oo rr ( ) (4oo+4oi)e +T 



^(00) = gOO [/(0) 



£oo + Coi ' (£ o + 6n)e- (00) +- (0) + (£io + eii)e- (10) +^ (1) ' 

Actually, from the branching property, £oo and £oi are independent of £io>£ii> 

r (oo) )T (o) )T (io) )T (i)_ 

Moreover since £oo and £oi are independent and Exp(l) distributed, then £;oo/(£oo + £oi) and 
(£oo + Coi) are independent, which allows to conclude that LT( 00 ) and are independent. 
Finally, multiplying along the line of the ancestors of a node u, we get the representation 

a. S .hm^ = n^ ) , (28) 

where the random variables (J7^)-ugu satisfy the claim. 

This is of course related to multiplicative cascade models. See [T5] . 



2.4.2 In the BST 

It is straightforward to see that, by embedding, the property of the above subsection holds true 
for limiting proportions of nodes in the BST, as in oo. 

This property is also known in the LBST (this gives another proof of this property in the BST). 
Let us now sketch the argument for LBST. 

Assume x\ fixed. Consider the tree L n after insertion of the n data X2, • • • , x n+ \. Let K(n) := 
£ [2,n + < x±,} be the number of nodes in the left subtree rooted in 0. Since the Xj 

are i.i.d., U[0, 1], the conditional distribution of K(n) on x\, is a binomial B(n,xi). Hence, by the 
strong law of large numbers, 

K(n) a.s. 
> X\. 

n n 

Now, the subtree to rooted in u = and the subtree t± rooted in u = 1 are "copies" of L n . The 
subtree to is built with the random variables in the list X2,xs, ... , that are smaller than x\ (t\ is 
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build with the ones that are larger than x\). In particular, the label Xt of the root of to is the first 
value among X2, X3, . . . , smaller than x\. It is easy to check that xt is uniform on [0, xi], therefore 
it has the following representation: xt = x\U where U is uniform on [0, 1] and does not depend 
on the value x\. Hence, the asymptotic proportion of nodes in the subtree too is x iU while it is 
xi(l — U) in toi (what happens in the subtree ti is totally independent). 

This iterative construction of the LBST explains why it enjoys the same property as (j28j) in the 
Yule process, and so does the sequence of underlying BST T n . This is a strong, which means a.s., 
version of the analogy between BST and branching random walks, first given by Devroye |17j . 

3 Convergence of martingales 

In this section are given the main results about the asymptotic behaviors of the Yule and BST 
martingales. The martingale connection (Proposition 12.2)) allows to express the links between the 
limits. 

3.1 Additive martingales 

Theorem 13.11 gives an answer to a natural question asked in ^])| about the domain in the complex 
plane where the BST martingale is L 1 — convergent and uniformly convergent. Theorem 13.41 gives 
the optimal L 1 domain on R. 

Theorem 3.1 For 1 < q < 2, let V q := {z : sup 4 E\M(t, z)\ q < 00}. Then V q = {z : f(z,q) > 0} 
with 

f(z,q) := 1 + q(2Mz - 1) - 2\z\ q . (29) 
If we denote V := Ui< g <2V 9; we have : 

a) As t — > oo, {M(t,z)} converges, a.s. and in L , uniformly on every compact C ofV. 

b) As n — > 00, {A4 n (z)} converges, a.s. and in L 1 , uniformly on every compact C ofV. 

Proof: a) is proved in 11_ Theorem 6 (see also [5]). 
b) We will prove 

lim sup Esup\M n {z) - M N {z)\ = 0, (30) 

N n>N z£C 

which implies the uniform L 1 convergence and, since (sup 2gC - \Ai n (z) — M.N(z)\) n >N is a sub- 
martingale, this will imply also the a.s. uniform convergence 2 . From the martingale connection 
f Proposition 12.2)) . we have 

M n {z) - M N {z) = E[M(r n , z) - M(r N , z)\T (n) ] 

so that taking supremum and expectation we get 

Esup\M n {z) - M N (z)\ <E[sup\M(T n ,z)-M(T N ,z)\) . 
zee Vzec / 

2 For the uniform a.s. convergence, it is possible to give a proof directly from |11| 
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Taking again the supremum in n we get 



supEsup|7W n (z) -M N (z)\ < E sup sup|M(r„,z) -M(t n ,z)\ < EA n , (31) 

n>N zGC n>N \zeC J 

where we have set A n := sup T>Tn (sup zeC \M(T, z) — M(r n , z)\). Since M(t,z) converges a.s. 
uniformly, we have a.s. lim n A n = 0. Moreover, by the triangle inequality A n < 2Aq, and by 
the proof of Proposition 1 in [5], Ao is integrable. The dominated convergence theorem gives 
lim n EA n = and (|3()j) holds, which ends the proof of Theorem 13.11 ■ 



Remark 3.2 As usual the L 1 convergence in a) of the above theorem comes from a L q bound (for 
some 1 < q < 2); more precisely, following the steps in ' L 8l section 2.4, the quantity 

A(A) := (M(t,z)-l)e^ z -^ 

satisfies 

E | (3 t (z) \ q < e tq( - mz -V f exp (sf(z, q)) ds for 1< q < 2. (32) 

Jo 

Theorem 3.3 Let us assume z 6 (z~,z+). 

1) We have the limit martingale connection : 

c2z-l 

a.s. M(oo, z) = |— - Moo(z) , (33) 

where the exponential variable £ is defined in 

2) We have the following two splitting formulas: 

a) for the Yule process, 

M(oo, z) = ze (1 - 2z)ri (M (cx), z) + Mi(oo, z)) a.s. (34) 

where Mq(oo, z) and Mi(oo, z) are independent, distributed as M(oo, z) and independent 
ofn. 

b) for the BST, 

Maoiz) = z {U^M^iz) + (1 - Uf^M^z)) (35) 

where U ~ £/([0, 1]) is defined in \2b}) . M.oc,(o)( z )> M-^i^z) are independent (and in- 
dependent of U) and distributed as A4oo(z). 

Proof: 1) is a consequence of ()20|) and the martingale connection (|21|) . 
2) a) For t > t\ we have the decomposition 

M(t, z) = ze^- 2z ^ \m<® (t- n ,z) + M« (t - n, z)] (36) 

where for i = 0, 1 



M { i\s,z)= zW ^ eS(l ~ 



-2z) 
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and H 1 ® is defined in Section \ZA\ 

b) Take t = r n in (|36j) . condition on the first splitting time t%, apply the branching property, 
let n — > oo and apply the limit martingale connection (|33|) to get 



;2z-l / t2z-l a2z-1 \ 

MooW = a^ 1 " 2 ** ^K )(0) (z) + Si Moo>(1)(z )J (37) 



T(2z) 

where £o anci £1 come from section |2~H which yields b) with the help of (|27|). ■ 
The following theorem gives the behavior in the remaining cases 

Theorem 3.4 For z 6 (0, oo) \ (z~ , z£), then a.s. lim t M(t, z) = and lim n .M n (,z) = 0. 

Proof: The continuous time result is in (see also [5]); it remains to use again the martingale 
connection (1211). ■ 



3.2 Derivative martingales 

From the above section, we deduce that the derivatives 

M f (t,z):=-^-M(t,z), M' n (z) := ^M n {z) (38) 

are martingales which are no longer positive. They are called the derivative martingales. Their 
behaviors are ruled by the following theorem. 

Theorem 3.5 1) For z £ (z~, z£), the martingales (M'(t,z),t > 0) and (M' n (z),n > 0) are 
convergent a.s.. Let us call M'(oo,z) and M' 00 (z) their limits. 

a) We have the (derivative martingale) connection: 

M'(oc> z) = ^ (MM + 2 (logC - ^!||) MM) a.s. (39) 

where £ ~ Exp(l) is defined in and is independent of Moo(z) and M'^z). 

b) We have the splitting formula: 

MM) = zu^M^M + ^-uf^^'ooM 

+ 2z (U 2z ~ l log U) MM Z ) + 2z (C 1 " u ?*~ l Ml " u )) ^oo,M 
+ z- l MM (40) 

where U ~ W([0, 1]) is defined in \2b)) . and the r.v. M'^ ^(z) and M'^ ^(z) are 
independent (and independent ofU) and distributed as M' OQ (z). 

2) a) The martingales (M'(t,z~),t > 0) and {M' n (z~),n > 0) (resp. (M'(t,z+),t > 0) and 
(M' n (z+),n > 0)) are convergent a.s.. Their limits denoted by M'(po,z~) and M'^z') 
(resp. M'(oo, z£) and M' OQ (z^)) are positive (resp. negative) and satisfy 

E(M'(oo, z-)) = E(Moo(*7)) = ( 41 ) 
E(M'(oo, z+)) = E(MM)) = -oo. (42) 
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b) M'{oo,zf) and M'^z^r) satisfy equations similar to KSty) . \34\l an d $EW : 

e 



M'(oc,z±) 



T(2zl 



M'(oc,zt) = zte^- 2z ^(M^oc,zt) + M[(oo,zt)) 
MU*t) = ^(u^-'M'^iz^ + il-Ur^-'M'^t) 



a.s. . 



(43) 

(44) 
(45) 



Proof: 1) For z G (z~,z^) the a.s. convergence of M'(t,z) is a consequence of the uniform 
convergence of M(t, z) (by Theorem 13. 1|) and analyticity. Taking derivatives in the martingale 
connection (|21|) gives 



M'(T n ,z) 



C' n {z) 
C n (z) 



2r n 



;(z)M n (z)+ %{z)M' n {z) 



(46) 



Using (j!9|) again and 



n-l 

£ 



j + 2z ' r(x) 



71 — 1 j 

lim I log n — > 

n V x + 7 



we get 



a.s. lim 



C n (z) 



2t„ 



r'(2z) 

T(2z) 



3=0 



+ log<£ 



(47) 



We conclude that M! n {z) converges and that Ai'^z) satisfies (|3"9*|) which proves a). 
To prove b), we differentiate (|36jl with respect to z 

M'(t,z) = (z- 1 -2n)M(t,z) + ze {l ~ 2z)Tl \m^' \t - n, z) + M«' \t - n, z) 

and we use the same technique as above: take t = r n , let n — > oo, apply ()39j) and its analogs with 
(M'W, jW®, jW'W,&)i=0,i instead of (M', M, M', £), and use (j27J). 

2) For z = 2g , the a.s. convergence of the martingales M'(t,z) and the signs of the limits are 
proved in [5], and so is the relation 

EM'(oo, z-) = -EM'(oo, z+) = oo . 

Relation (|4*3"|) is a consequence of and l|47|l. since A^oo(^) = 0. 

Formula (|45|1 of 2) is straightforward from (|40|) since .Moo^) = 0. Formula l|33 |l is (|33jl for 



An easy but interesting consequence of (|4UI) is obtained in the following corollary, just taking 
z = 1 in (|39|) and (j40j) (remember that Ai n (l) = 1). The distributional (weaker) version of (|49|) 
below is the subject of a broad literature (see for instance Fill, Janson, Devroye, Neininger, Rosier, 
Riischendorf EHl QH E21 03 OH]) and some properties of the distribution of M'^il) remain 
unknown. 
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Corollary 3.6 We have 

M'(oo,l) =e(^'oo(l) + 2(loge + 7-l)) a.s., (48) 
where 7 is the Euler constant, and M'^l) satisfies the a.s. version of the Quicksort equation: 
ACU) = UM'^il) + (1 - U)M^ {1) (1) + 2U\og U + 2(l- U) log(l - U) + 1 , (49) 

where as above, M'^ (o)(l) anc ^ -^[>o (ljW are independent (and independent of U ), distributed as 
-M'ooCl) ana" £/~ W([0,1]). 

4 Convergence of profiles 

4.1 Random measures and profiles 

Recall that the profile of the tree T n is the sequence 

U k {n) = #{u e dT n : \u\ = k}, k > 1 , 

and that, according to Q, for every e > 0, there exists a.s. uq such that for n > no, 

U]c(n) = for k ^ [(c' — e) log n, (c + e) log n] . 

It means that the convenient scaling for k is (logn) -1 . We are interested in the asymptotic behavior 
of Uk{n) for k = xlogn and x fixed in (c',c). It is well known that 

HU k (n)) = ^ 

where is the Stirling number of the first kind. By analysis of singularities, Hwang ( 24 ) got 
an asymptotic estimate; for any I > as n — > 00 and k — > 00 such that r = kj log n < I : 

EU k (n)= (2 /°^ (l + o(l)). (50) 
A;! n I (r) 

We deduce easily that for any t > : 

l-m( k ) 

Jl ' V log 72 ' 

E ^")- r(Efe) ^ a + °(D). (5i) 

where o(l) is uniform for A;/ logn < £ and 772 was defined in (j3J. 

Jabbour in [23] introduced the random measure counting the levels of leaves in T n 

r n := u k(n)S k = ^ 5\ u \ . 

k uddT n 

He proved that for x € (2, c) 

a.s. lim logr n (]xlogn, 00 [) = 1 — 772(2;) (52) 

n^oo log n 
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and that the same result holds for x £ (c',2), replacing ]xlogn, oof by ]0, x\ogn[ . 
At the level of random variables, Jabbour & al. proved in |15j that 

a.s. lim sup ( JM!^ - Moo (-^)) =0. (53) 
n fc/togne[l.2, 2.8] \®U k (n) K 2\ogn' J 

Since their approach relies on I? estimations of M n {z) with z = fc/(21ogn), they guessed that the 
range [1.2, 2.8] may be extended to I := (2 - 2 1 / 2 , 2 + 2 1 / 2 ) = (0.585... ,3.414...) which corresponds 
to the maximal interval in z of L 2 convergence. In the following subsection we extend the validity 
of the above result to (compact subsets of) the entire interval (c',c) = (0.373... ,4.311...). 

This type of result is very reminiscent of sharp large deviations in branching random walks 
f [TUj. [TT] . [iT]^. Actually, we use the embedding method and results on the Yule process, as a 
branching random walk in continuous time. The random measure counting the levels of leaves in 
the Yule tree is 

pt = ^2 6 \ u \ ' 

uedTt 

(recall (|14|l). With the notations of |41j . the exponential rate of growing is ruled by the function 

x i ^ L*(x) := sup6*x — L(9) = 772(2?) — 1 , 
e 

where the function L is defined in (|15j) . This allows to define three areas: 

- for x g]c',c[ , r]2{x) < 1, so there are in mean about r > 2 ^) t leaves at level ~ xt. Call this 
interval ]c',c[ "supercritical area". 

- for x 6 [0, c'[U]c, 00 [, rj2(x) > 1, so there are in mean about e^ 1 ~ ri2 ^ t leaves at level ~ xt. Call 
this set "subcritical area" . 

- call the set {c',c} "critical area" . 

More precisely (Theorem 1' p. 909 f° r x i n t ne supercritical area, 



lim Vt e tL * {x) Pt ([xt}) = J ^lZM. M (oo, x/2) a.s. . (54) 
t— >oo V 2tt 

It is now tempting to replace t by r n and pt ( [xt] ) by p Tn ( [x log n] ) = r n ([x log n] ). To validate 
this, we need some uniformity in x in (|54|) . In 11 , Biggins obtained such a result. However, it was 
in the non-lattice case, so we give in the next subsection a complete proof. 

4.2 Main result 

The aim of this section is to prove the following result 
Theorem 4.1 Almost surely, for any compact subset K of (c' , c) 

lim sup ( f fc(n) NN -Moo(-^)) =0. (55) 
- fc:(fc/iog„)e* W^W) V 21ogn^ 

Proof: The following lemma, whose proof is postponed yields an asymptotic uniform behavior 
for pt(k). 

Lemma 4.2 Almost surely, for any compact C of (z~,z^~), 

lim sup z k Vte t{1 ' 2z) \pt(k)-M(oo,z)e- t ^-] =0. (56) 
k>l,zec kl 
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Let C be a compact subset of (z c ,z+). From Lemma POl we know that 



Pt {k) = M(oo,z)e 



k\ 



+ o(l)z- k t-^ z e 



k.-l/2-t(l-2z) 



Recall that o(l) is uniform in k and in z E C. If V^' stands for the Poisson law with parameter A, 
notice that a appears in the previous expression. Using a change of probability from to 
T>( 2tz \ we get 

p t (k) = «- fc t-V2 e -*(i-2*) |iV 2 M (oo, z)V^ 2tz) ik) + o(l) 
Using the local limit theorem |33| . we have 



lim sup 



V2^xr {x) (k) 

Now, we set A = 2tz with z 6 C which yields 



2A 



lim sup sup 

^ 00 zee k 



\[^Tz V {2tz \k) -exp ( 



(A; - 2tz) 5 
iti 



0. 



Hence, 



with 



p t (k) = A t (k,z) 



exp i - ^ 4 t 2 ^ 2 ) M(0 °' z) + (( 47rz ) V2 + M (°°' z ) ) °< 1 : 



(57) 



A t (k,z) :-- 



-t(l-2z) 



Z k (4TTtz) 1 / 2 



Remembering that Uk(n) = p Tn (k), we take t = r n and z 
the estimate (|51[). we get 

A-„(M) 



2 log n 



in l|57jl. Using (|T^|) again and 



[EC/ fc (n)] e~ 2z r(2z) ~ 1 ' -'V 4r n z 

Now we apply the limit martingale connection (|33|) and notice that 

sup ((Airz) 1/2 + M(oo, zj) < oo 

and we conclude 



C4(n) = [EC4(n)].M 00 (z)(l + o(l)) , 
with z = k/(2logn) and o(l) uniform in z G C. 



16 



4.3 Proof of Lemma 14.21 

We use the following lemma, which is the continuous time version of Lemma 5 in . Its proof 
can be managed with the same arguments, replacing Lemma 6 there, by Remark 13 .21 We omit the 
details. 

Lemma 4.3 For any zq G (z~, z£) there exists r > for which z~ < z$ — r < zo + r < z+ and 
such that a.s. 

lim sup [ Vt | M(t, ze ir >) - M(oo, z) \ e ~ 2t < 1 -^ v) dr] = _ ( 58 ) 

ze\zn-r,zn+r] J-n 



1 z€[z -r,z +r] 

Write 



M(t,z) = e t ^Y,Pt(k)z k , 



and the Fourier inversion formula yields 



-t(l-2z) -k fTT 



pt(k) = - — / M^ze^e-^-^e^drj 

and, owing to Lemma 

2itp t (k)e t ^- 2z) z k ^t = M(oo, z)V~t ^ e-^-^e-^drj + o(l) 

J — TT 

with o(l) uniform in k and in z in any compact subset of (z~, z+). Now, from the Cauchy formula 
we get that 



e -2 z t(l-e^) e -ik VdrJ = 2 ^2zt 



(2zt) k 



k\ ' 

yielding (|56|). which ends the proof. ■ 

5 Tagged branches and biased trees 

We now introduce the tilting or biasing method in the setting of the Yule and BST processes. 

This procedure consists in marking at random a special "ray" or branch of the Yule tree. The 
special ray of the BST is the spine of the marked Yule tree observed in the splitting times. One 
then applies to this special ray some evolution rule, different from the other branches. The result 
of this change of probability is that the whole tree owns a different behavior. This method is usual 
and fruitful in modern developments on branching processes, and also in the study of fragmentation 
processes 1141 ISO] ). The introduction of the tilting method in the setting of BST provides new 
tools to study some characteristics of the BST. 

The martingales studied above are the right change of probability to pass from a tilted model to 
the non-tilted model: they appear as Radon-Nikodym derivatives. The parameter z, present in the 
martingales (M(t, z))t>o and (A / f n (z)) n >o, allows to tune the growing of the special ray, changing 
in a visible way the shape of the (Yule or BST) tree. 
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5.1 Tilted fragmentation and biased Yule tree 

First at all, let us enlarge the probability space of the fragmentation process introduced in 
Section \2.2l Let us denote by J~t the cr-algebra of the interval fragmentation process F(.) up to 
time t and V be a U([0, 1]) r.v. independent of the filtration (.Ft) t> Q. 

Since P(V G {k2~i , < k < 2 j , j G N, k G N}) = 0, we may define P-a.s. for every t a unique 
S(t) G U such that Ig( t ) is an interval of F(t) and V G JW) . In other words, S(t) is the element of 
U encoding the fragment containing V, its depth is s(t) := | the length of IgM is 2~ s ^ and 

P(5(t) = u I = 2~l w l , u G 91Tf (59) 

(it is equivalent to choose a fragment at random with probability equal to its length, it is the 
classical size-biasing setting). 

Now we build the process (TTt)t>o of marked binary Yule trees associated with the pair (F(.), S(.)). 
The only change with Section [2,21 is the role played by the random variable V (missing in Section 
12.2(1 . During the construction of the Yule tree, at any given time t, each leaf in TTj corresponds to 
an interval in the fragmentation F(t). For every t we mark the leaf S(t) of TTj that corresponds to 
the interval Is(t) that contains V. We obtain a marked tree called (TTt)t>o- Thus, the set of nodes 
marked during [0,i] are the prefixes of S(t). We call spine the process S(.). 

In fact, given TL t , one can recover (F(t), S(t)). Moreover, with the whole process (TT t ) 4 >o one 
can a.s. recover V: 

t>0 

As a consequence of the general theory of homogeneous fragmentations (see Bertoin [Jj) or by 
a direct computation, we see that (s(t),t > 0) is an homogeneous Poisson process with parameter 
1. In particular, if 

£(t,z) := (2z) s ^e t{l - 2z) (60) 

then E £ (t, z) = 1. Conditionally on T r = T r V a(S(r), s < r), the restriction of the fragmentation 
F(. + r) to the interval Ism ^ s distributed as a rescaling of F(.) by a factor 2~ s ^ r \ which entails 
that (£(t, z),J r t) t>0 is a martingale. By the size biasing scheme (|59")) and the definition (|TB|) we get 

M(t,z) = E [£(t,z) I T t \ ■ (61) 

Hence, the Yule martingale appears to be a projection of the martingale £ (which is a spine- 
measurable function) on the cr-algebra containing only the underlying binary tree. 

Coming back to the discrete time, set Spine„ := S(r n ) and s n := |Spine n |. Notice that the 
underlying unmarked tree TT Tn is J-"r n \ -measurable. 

Applying (|59|) at the (!Ft,t > 0) stopping time r n , we get for every leaf u G dT n (and k > 1) : 

P(Spine„ = u \ F {n) ) = 2~W, (62) 
F(s n = k I F {n) ) = U k (n)2-\ k \ . 

Thus, for fixed n, to draw at random the marked tree TT Tn , one may choose at first a binary tree 
T n , and then pick the marked leaf according to the conditional distribution (|62|) . Let J-iq\ be the 

trivial cr-algebra, and for n > 1 let be the cr-algebra obtained from Ti n ) by adjunction of 
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S(ti), ...,S(r n ). Let us consider £ n {z) := E £(r n ,z) \ F(n) (with £q{z) := 1). Prom Lemma l2~Tl a) 
we have E(e Tn ( 1-2 *)) = C n (z)~ l hence 

£ n (z) = (2z) s "C n (z)- 1 . (63) 

From the martingale property of £(t,z) and the definition of £ n (z) we see that (£ n (z),J-f n -\) is a 
martingale. Like in (|61|). we get easily 

M n (z)=E[S n (z)\T (n) ], (64) 

so that the martingales M(t, z) and A4 n (z) are obtained from the "exponential martingales" £(z, t) 
and £ n (z) by projection. 

Moreover the martingale connection ()21j) may be seen as the projection on Fin) of the relation 

£{T n ,z) = %[z)£ n {z) . 

Note that one may also obtain M n (z) as 

M n (z) = E(M( W )|^ (n) ); 

this is a kind of integration with respect to the time. All these martingales are precisely the main 
tool to tilt probabilities. In particular we define P^ on (T t ,t> 0) by 

pf 2 / } =£(t,z) P, (65) 
By projection on (.T^i > 0), yields 

pg*) = M(t, z) ¥\ Tt . (66) 

If d ¥ (resp. d p( 2z )) is the restriction of P (resp. p( 2 ^)) to V n J-^, the discrete versions of the 
above relations are 

M 2z) =£ n (z) d F, , d wf z) =M n (z) d ¥ lT . (67) 

It turns out that P (2z) can be seen as a probability on marked Yule trees. This is the object of 
the following subsection. 



5.2 A biased Yule tree 

Recall the construction of the Yule tree process (TLt)t>o given in Section \2. 21 Each leaf u of the 
current Yule tree owns a Exp(l)-distributed clock. At its death, u becomes an internal node, and 
two leaves uO and ul appear (with new Exp(l), independent of the other ones). 

Let us consider now a model of marked binary tree (TT^ )^>o defined as follows. 

In TP* there are now two kinds of nodes: marked and unmarked. We denote by (v, m) the node 
v if it is marked, and by (v,fh) the node v if it is unmarked. At time 0, TTq = {(0,m)}. 

Each unmarked leaf owns a Exp(l)-distributed clock. The marked leaf owns a Exp(2z)- 
distributed clock. Now the evolution of the tilted Yule tree is as follows: 

• when an unmarked leaf u dies, u becomes an unmarked internal node, and two unmarked leaves 
(u0,m) and (ul,fh) appear. 
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• when the marked leaf u dies, u becomes a marked internal node. Two leaves uO and ul appear. 
We mark at random uO or ul (equally likely) and let the other one unmarked. 

The marked nodes form a branch in the tree. The behavior of the marked branch depends on the 
value of 2z. If 2z > 1 then, the growing of the marked branch is faster than the other ones, when 
2z < 1, the growing of the marked branch is slower. The depth of the marked leaf follows a Poisson 
process of rate 2z. Notice that we have already met this Poisson process in the proof of Theorem 

EH 

It turns out that under f( 2z \ the process (Tl^)^ has the same distribution as the process 
(Tl*)oo (consider the spine as the marked leaf). For the underlying branching random walk this 
construction is classical Pj.jl4j.pU] For the fragmentation it can be found in [5]. 

Remark Denote by S*(t) the marked leaf in TT*, and consider 

t>0 

By symmetry of the splittings, the random variable V* is U[0, ^-distributed and independent of 
the process of the underlying unmarked trees derived from (TT*)oo. Note F*(.) the fragmentation 
process that is associated with the unmarked process derived from TT£. 

The a.s. bijection between (F(.),S(.)) and (F(.),V) (under P^ 1 )) explained in the beginning 
of Section 15.11 is also valid between (F* (.) , S* (.)) and (F*(.), V*). Hence, the law on marked 
fragmentation (F(.), S(.)) under p( 2z ) (defined by is the law of S*(.)). It follows that, 

under p( 2z ) one may also build the spine by choosing at first a uniform random variable V and follow 
the fragment containing V. This is not true in general when using the tilting method. Usually, at 
each splitting of the marked fragment M, one has to choose the new marked fragment among the 
children of M, according to a rule depending on the size of these fragments. It cannot be summed 
up by the drawing of a random variable V, once for all as in our case, where sizes are equal. 

According to the representation by (TL*) t >o, the Yule tree owns a natural decomposition according 
to the marked branch. Let u be a node of the marked branch. One of the nodes uO or ul does not 
belong to this marked branch. Assume that it is uO. Then, (up to a change of the time origin), 

• the subtree rooted in uO is a copy of the untilted Yule tree; 

• the subtree rooted in ul is a copy of the tilted Yule tree. 

We can also see this process as a branching process with immigration, as presented in [3U] (see also 
EH chap. 10 and j2U). 

5.3 A biased BST model 

The tilted Yule tree can also been stopped at time r n of the creation time of the nth internal 
node. Let T n be the obtained marked binary search tree. The discrete evolution is as follows: 
T n is a complete binary tree with 2n + 1 nodes, in which one leaf is marked and the n other ones 
are unmarked. Knowing T n , the marked tree T n+ i is as follows: 

we choose the marked leaf with probability 2z/{n + 2z) and each unmarked one with probability 
l/(n + 2z). 

• If the chosen leaf v is unmarked, then v becomes an unmarked internal node and two unmarked 
leaves vO and vl are created. 

• If the chosen leaf v is marked, v becomes a marked internal node. Two leaves vO and vl appear. 
One marks at random vO or vl (equally likely) and let the other ones unmarked. 

We note Q( 2 ^ for the law on the marked binary search tree process (F n ) n under this model of 
evolution. 
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Once again, the BST can be decomposed along the marked branch. The speed of growing of 
the marked branch depends on the value of 2z. One may also interpret the size of the subtrees 
rooted on the tilted branch as tables in a Chinese restaurant (see Barbour & al. 2 , Pitman [34 J, 
and obtain, like this, new explanations of the behavior of the size of the subtrees rooted on the 
marked branch. 

As in the previous subsection, we denote by (v,m) a marked node and (v,fh) an unmarked 
node. The dynamics we described above yields the following conditional probabilities: 
if (vfh) E dT n , then 

Q( 2z )(Spine n+1 = Spine n ,T n+1 = f n U {(vO,fh),(vl,fh)}\f n ^ = 



n + 2z 

If (v,m) G dT n , (i.e. Spine n = v), then 

Q( 2z )(Spine n+1 =vO,f n+1 = f n U {(vO,m),(vl,m)}\f n ) = 

In + Iz 

similarly, 

Q^(Spine n+1 = vl,f n+1 =f n U {(vO,fh),(vl,m)}\f n ) = ■ 

Summing up, we have for any marked tree t n+ \ with n + 1 nodes that can be obtained from T n by 
one insertion 

z > (T n+1 = t n+l \T n ) = — - (68) 

n + 2z 



and 

qW(t;+i = t n+1 \f n 



(1/2) 



n + 1 



Thus, by iterative construction, 



(2z) 1-1 



(1) 



n w^ia. w , c , (!) -.. 4(!) . 



f, fJ- j + 2z 



Hence, (Q)( 2z ) is absolutely continuous with respect to with the Radon-Nikodym derivative 

announced in (|67[). Since and d pW (the non-biased models) are identical, the law of (T n ) n 
under Q( 2 ^ is rf p( 2z ). 

One finds an analogous result (in another context) and its proof in Lemma 1 and 2 of |14j . 

5.4 Spine evolution 

Thanks to the previous subsections, it appears that under d f( 2z ) 

n-1 

s n = l + ^e k (69) 
i 

where (ek)k>i are independent and for every k > 1, €\. is a Bernoulli random variable with parameter 
(we use the notation e k ~ Be(^j)). 

Proposition 5.1 For any parameter z > 0, 
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1) (strong law) 



l im = 2z , -a.s.. (70) 

logn 



2) (central limit theorem) The distribution of - under d ip( 2z ) converges to a standard 

\j2z\ogn 

normal distribution AA(0, 1). 

3) (large deviations) The family of distributions of (s n , n > 0) under d f( 2z ) satisfies the large 
deviation principle on [0, oo) with speed logn and rate function r]2 Z where the function rj\ is 
defined in (TJJ). 

Proof: 1) and 2) are consequences of known results on sums of independent r.v. (see [SHI)- Notice 
also that s n — K^ 2z \s n ) is a martingale. 

3) is a consequence of Gartner-Ellis theorem. ■ 



Once again, this proposition shows that under the biased model, the BST evolves rather dif- 
ferently that under the usual model. For example, the marked leaf depth is about 2z log n. So, for 
z > z^ , the marked leaf is higher that the height of the non-biased BST. 

5.5 Depth of insertion 

In introducing the BST model, we defined the sequence (D n ,n > 0) as the successive inserted 
nodes and d n = \D n \ (see Q). In continuous time, we set rj{t) = infjs > t : TT S ^ TT t } for the 
first time of growing after t, and D(t) = T^r)(t) \ f° r the node of the coming insertion. 

Let us stress on the difference between the spine processes (s n ,n > 0) and (s(t),t > 0) and the 
insertion processes (d n ,n > 0) and (d(t),t > 0). 

The (marginal) distribution of d n is given in Jabbour [25] ( see a l so Mahmoud |31| ) 

Ez d n = Cn(z)_ = (2z)(2z + l)---(2z + n-l) 

n + 1 (n + 1)! { ' 

so that 

n-1 

, law -. 

a n — 1 + £ k i 
1 

where (£k)k>i are independent and for every k > 1, ~ Be(^2) . 

Proposition 5.2 The following convergences hold: 
(i) (d n ) satisfies a law of large numbers: 

d„ 



2 log n 

(ii) it satisfies a central limit theorem: 

d n -2 log n 
\J1 log n 

(Hi) We have 



1; (72) 



M(0,l). (73) 



a.s. liminf - n — = — = z , a.s. lim sup — = — = z + . (74) 

n 21ogn 2 ' „ F 21ogn 2 v 1 
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Note that (Hi) of course, implies that (ii) is not an almost sure convergence. 

Proof: The arguments to prove (i) and (ii) are classical; (Hi) is a consequence of (J5J). 



For the Yule tree, we did not find the distribution of d(t) in the literature. Let us give the joint 
distribution of (N t ,d(t)) (for £ fixed). 

Since {N t = n + 1} = {r n < t < r n+1 }, we have E(z d ^s Nt ) = £g°(Ez d ")P(A^ = n + l)s n+1 . 
Since the distribution of Nt is geometric of parameter e - *, and owing to (fTTj) we get 

E{z d(t) s N t) = J- {2Z)(2Z + { l Z + n ~ l) e~\l - e- l ) n s n+l (75) 
- 1 ; -. (76) 



(e* - l)(2z - 1) 
Taking s = 1, we get the marginal of d(t) 



Ez d{t) 



e t(2z-l) _ 1 



(e* - l)(2z - 1) 

Transforming these generating functions into Fourier transforms, it is now easy to conclude that 
Proposition 5.3 As t — ► oo, 



_ t d(£) -2t\ i„ 



N t e-\^=^)^(t,G) 



where £ is defined in \19\) and G is M(0, 1) and independent of £. 
Remark: For the same reasons as in (|74[). we have 

a.s. hminf^ = ^=,- , a.s. limsup *® = °- = z+ . 
t 2£ 2 ' t 2t 2 

Under the change of probability p( 2z ) (or using Kolmogorov equations) the distribution of Nt is 
given by: 

E (2*)^v t = E [(2^)»(*) e *(i-2*)^V*] (77) 

oo 

= e^ 1 " 2 ^ + ^(E(2z) s ")P(^ = n + 1)<T +1 ] (78) 
o 

oo n—2 . _ 

= e^i-^^ + e-^fl-e-^-iJjlIJ^^] (79) 



1-e- 



2i- 



(80) 



where is any real in [0, 1]. Hence, under f( 2z \ the r.v. Nt — 1 is a negative binomial of order 
2z and parameter e~ l . As £ — > oo, the p( 2 ^) distribution of e~*iV t converges to a 7(2z)-distributed 
random variable. Actually we have for every z, £, /i 

E^t+ftl^i] = (iV t - l)EiV h + E^JV), (81) 
= e h (N t -l) + l + 2z(e h -1). (82) 
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This implies that (e _ *[A r t — 1 + 2z]) t >o) is a p( 2z ) martingale. If 2z > 1, it is positive hence 
convergent P^ 2z ) a.s. If 2z < 1, then (e - * N t )t>o) is a positive supermartingale, hence P^ 2z ) a.s. 
convergent. ■ 
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